Project Mlexai: Applying Machine Learning to Web Document Classification*

نویسندگان

  • Ingrid Russell
  • Susan Coleman
چکیده

We present work on project MLExAI, funded by the National Science Foundation with a goal of unifying the artificial intelligence (AI) course around the theme of machine learning. Our work involves the development, implementation, and testing of an adaptable framework for the presentation of core AI topics that emphasizes the relationship between AI and computer science. A suite of adaptable hands-on laboratory projects that can be closely integrated into a one-term AI course and which would supplement introductory AI texts has been developed. The paper focuses on one of these projects, how it meets our goal, and presents our experiences using it. The project involves the development of a learning system for web document classification. Students investigate the process of classifying hypertext documents, called tagging, and apply machine learning techniques and data mining tools for automatic tagging. A summary of our experiences using the projects during four course offerings over the last two years are also presented. CCSC: Southeastern Conference

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MLeXAI: BIOMEDICAL TERM CLASSIFICATION

Machine Learning is an important area of Artificial Intelligence which is generally applicable to almost any field of science. Early exposure of students to the potential of machine learning could have a positive impact on their attitude towards Artificial Intelligence in particular and computer science in general. In this paper, we present a semester long machine learning project that was inco...

متن کامل

Recognizing American Sign Language Letters: A Machine Learning Experience in an Introductory AI Course

This paper describes a class project to introduce machine learning topics to an introductory artificial intelligence course as part of the MLExAI Project. The project’s topic was taken from the area of computer vision, specifically the use of principal component analysis for image classification. As a project within their AI class, students developed programs in the GNU Octave programming envir...

متن کامل

Final Project Report: Real Time Tennis Match Prediction Using Machine Learning

This project adopts an innovative data model by combining both historical match data and real-time stats, and apply machine learning to predict tennis match outcomes. Specifically, we explore and compare four data models mixing historical data and real-time stats, while applying machine learning techniques such as logistic regression, support vector classification (SVC) with linear, RBF and pol...

متن کامل

Fault diagnosis in a distillation column using a support vector machine based classifier

Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007